arXiv: 1506.08361vl [cs.ET] 28 Jun 2015 


Simultaneously Solving Computational Problems Using 

an Artificial Chemical Reactor 


Jaderick R Pabico 
Institute of Computer Science 
Coiiege of Arts and Sciences 
University of the Phiiippines Los Banos 
Coiiege 4031, Laguna, Phiiippines 
63-49-536-2313 
jppabico@uplb.edu.ph 


ABSTRACT 

This paper is centered on using chemical reaction as a 
computational metaphor for simultaneously solving prob¬ 
lems. An artificial chemical reactor that can simultaneously 
solve instances of three unrelated problems was created. 
The reactor is a distributed stochastic algorithm that sim¬ 
ulates a chemical universe wherein the molecular species 
are being represented either by a human genomic contig 
panel, a Hamiltonian cycle, or an aircraft landing schedule. 
The chemical universe is governed by reactions that can 
alter genomic sequences, re-order Hamiltonian cycles, or 
reschedule an aircraft landing program. Molecular masses 
were considered as measures of goodness of solutions, and 
represented radiation hybrid (RH) vector similarities, costs 
of Hamiltonian cycles, and penalty costs for landing an 
aircraft before and after target landing times. This method, 
tested by solving in tandem with deterministic algorithms, 
has been shown to find quality solutions in finding the 
minima RH vector similarities of genomic data, minima 
costs in Hamiltonian cycles of the traveling salesman, and 
minima costs for landing aircrafts before or after target 
landing times. 
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1. INTRODUCTION 

Combinatorial optimization problems such as traveling 
salesman problem (TSP), job-shop scheduling, vehicle 
routing, scheduling of aircraft landing, gene sequencing, 
and many others are problems whose solutions are of real- 
world importance. Exact algorithms have been proposed 
to these problems but prove inefficient for large problem 
instances (i.e., they are NP-hard) [17]. Graph-based heuris¬ 
tics such as branch and bound [27], as well as distributed 
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multi-agent based algorithms such as genetic algorithms m, 
memetic algorithms | 24lll3lfT4] . tabu search [28], simulated 
annealing m, simulated jumping [3], neural networks [23], 
and swarm intelligence [1511161 111] have been used to find 
time-restrained optimal and near optimal solutions for these 
problems. 

In recent years, different researchers have shown that the 
chemical systems of living organisms possess inherent com¬ 
putational properties [l8l[U|l]. Because of these, the chem¬ 
ical metaphor has emerged as a computational paradigm [9] 
[n m [M [6] llOj . Under this computational framework, 
objects such as atoms or molecules are considered as data 
or solutions, while interactions (i.e., molecular collisions or 
reactions) among objects are defined by an algorithm. 

Using the chemical metaphor, a distributed stochastic 
algorithm was designed to simulate a reactor where the 
molecules are being represented either by a human genomic 
contig panel, a Hamiltonian cycle, or an aircraft landing 
schedule. The chemical universe in the reactor is governed 
by reactions that can alter genomic sequences, re-order 
Hamiltonian cycles, or schedule an aircraft landing pro¬ 
gram. This computational paradigm can be used to solve, 
in parallel, very hard real-world problems. 

In this effort, an artificial chemical reactor (ACR) simulates 
chemical catalysis that will solve in parallel three NP-hard 
computational problems. These problems are the construc¬ 
tion and integration of RH map of the human genome, the 
solution to large instances of symmetric and asymmetric 
TSP, and the static aircraft landing scheduling problems 
(ALSP). The artificial chemical reactor was used as a compu¬ 
tational metaphor for constructing RH maps from different 
RH panels and integrating them to produce a single RH 
map for the genome. The ACR simulation of catalytic reac¬ 
tions constructs RH maps with the same quality as those 
constructed by RHMAPPER [26] and CONCORDE [2]. In 
addition, ACR was also used to find, in parallel, solutions 
to the TSP and the ALSP. 

2. DEVELOPMENT OF ARTIFICIAL 
CHEMICAL REACTOR 

This section briefly defines the three problems used in this 
study: the RH mapping problem, the TSP, and the ALSP. 
The development of the ACR is then discussed, while its 



underlying reaction algorithms defined, with a focus on 
solving the three problems in parallel. 

2.1 RH Mapping Problem 

The RH mapping problem can be viewed as follows. Given 
a collection of DNAQ fragments originating from identical 
copies of a chromosome, where some of the fragments are 
overlapping while others are disjoint. The problem is to 
derive the true DNA sequence of the chromosome from these 
DNA fragments. Since the DNA sequence of the chromo¬ 
some is unknown, a metric is needed to quantify the cor¬ 
rectness of a candidate sequence. A reasonable measure is 
to find the shortest DNA sequence that has all the fragments 
as subsequence [20]. This problem is generally known as the 
shortest common supersequence (SCS). 

Given a finite alphabet E, where in this case |E| = 4 per¬ 
taining to the DNA bases adenine, cytosine, guanine, and 
thymine, and a finite set of strings R from the superset S*. 
R can be seen as the set of RH panels. The solution to 
SCS is a string to € E* such that each string y £ R is a 
subsequence of w (i.e. one can get y by taking away letters 
from w) [21]. In RH mapping problem, any string w is a 
DNA sequence. The minimum length w will be the optimal 
solution to the RH mapping problem. 

2.2 Traveling Salesman Problem 

The TSP is formally defined as the problem of finding the 
shortest Hamiltonian cycle of a graph G{V, E) composed of 
a set of cities V = {va,V 2 , ■ ■ ■ ,Vn}, and a path set E = 
{{vi,Vj) : Vi,Vj € V}. Associated with G is a cost matrix C 
where each element Ci.j G G is the cost measure associated 
with path (vi,Vj) £ E. A Hamiltonian cycle is a closed tour 
that visits each city Vi £ V once. A symmetric TSP is when 
Cij' = Cj,i, otherwise it is an asymmetric TSP. The solution 
to the TSP is a Hamiltonian tour with the minimum cost /„ 
(Equation [T| . 

n—1 

fv = Cn,i + where n = number of cities (1) 

i = l 

2.3 Aircraft Landing Scheduling Problem 

The ALSP is the problem of deciding a landing time for a set 
of aircrafts P such that the total penalty cost fp for landing 
earlier or later than a target time is minimized. Given, for 
each aircraft pi £ P are: 

1. the earliest landing time a; 

2. the latest landing time h; 

3. the target landing time tp, 

4. the penalty cost per unit time, Qi, for landing before 

5. the penalty cost per unit time, hi, for landing after ti', 

6. the required separation time between pi landing and 
Pj landing Sij (where pi lands before Pj and pj £ P); 

7. the unknown landing time xt; and 


8 . a dependency variable Sij, where Sij = 1 if pi lands 
before Pj and 0 otherwise. 

The ALSP is formally defined in the mathematical program¬ 
ming formulation [311] shown in Equation [2] 

fp = Yf}i^i{9i max( 0 , U - Xi) + hi max( 0 , Xi - U)} 
s.t. Sij +Sji = l,j > r,i,j = 1,...,|P| 

Xj >Xi + SijSij - {k - ei)Sij, i = = 1,..., |P| 

6 i ^ Xi ^ li, i — 1 , . . . , |P|. 

( 2 ) 

2.4 Artificial Chemical Reactor 

The artificial chemical reactor (ACR) is defined by a triple 
ACR{M, R,A). M is a set of artificial molecules, P is a set 
of reaction rules describing the interaction among molecules, 
and A is an algorithm driving the reactor. In this paper, the 
molecules in M are DNA sequences, Hamiltonian tours, or 
aircraft landing schedules. The rules in R are reordering 
algorithms that create new molecules when two molecules 
collide. The algorithm A describes how the rules are applied 
to a vessel of artificial molecules simulating a well-stirred, 
topology-less reactor. A partitions the reactor into different 
levels of reaction activities. The level of reaction activity is 
a function of molecular mass. 

The RH strings Wi G E* (Vi = 1,..., |E* |), the set of all per¬ 
mutations n(H) of the \V\ cities in V) of the TSP, and the 
set of all permutations n(P) of the |P| aircrafts in P in the 
ALSP were considered as molecules. A string Wi encodes a 
DNA sequence that solves the RH mapping problem, a per¬ 
mutation nv encodes a Hamiltonian cycle that solves the 
TSP, and a permutation np encodes a landing schedule that 
solves the ALSP. The width |wi| of Wi, the cost /„ (Equa¬ 
tion [3 of traversing a specific nv, and the cost fp (Equa- 
tion[ 3 ) of scheduling a specific np were considered as molec¬ 
ular mass. 

If two molecules nii and m 2 collide and they encode solu¬ 
tions to the same problem, they react following a zero-order 
catalytic reactions of the form 

mi -I- m 2 -I- G —> m 3 -I- m4, 

where rrii are molecules (Vi = 1,..., 4) and G is a catalyst. 
The reaction can be mathematically thought of as a function 

Ri : M X M — M X M, 

where rrii £ M. Ri performs reordering of solutions as 
described by the following algorithm (let n be the length 
of the DNA sequence in the RH mapping problem, or the 
number of cities in the TSP, or the number of aircrafts in 
the ALSP - atom is taken as either a DNA base, a city, or 
an aircraft): 

1. Let an integer I £ [l,n] be the index of the Ith atom 
in any molecule m. Let i = 1. 

2. Take a random integer between 1 and n and assign it 
to 1. Let = 1. 


3. Taking the reactant rm, locate the Ith atom and move 
it as the Zth atom for mi+ 2 . 


^deoxyribonucleic acid 



4. Take note of the Ith atom in mi +2 and locate it in rm. 
Replace I with the value of the index of the atom in 

rrii. 

5. Repeat steps 3 to 4 until the 1th atom in mi +2 is the 
same as the l°th atom in rm. 

6 . For all indeces I with no atoms yet in mi+ 2 , move the 
1 th atom from reactant mi as the 1 th atom in product 
mi+ 2 . 

7. Repeat steps 2 to 6 for i = 2. 

If two molecules ms and me collide and they encode solu¬ 
tions to different problems, they react following a zero-order 
catalytic reaction of the form 

ms -I- me —>■ mj + mg. 

The reaction follows a mathematical function 
R 2 ■■ M X M —5> M X M 
and is described by the following algorithm: 

1. Let i = 5. 

2. Take molecule mi and mark the point of collision as 1. 

3. Take the 1th atom in mi and swap it with the 1 -|- 1th 
atom in mi. If 1 = n, swap the 1th atom with the first 
atom, instead. 

4. The resulting molecule is the product mi +2 

5. Repeat steps 2 to 4 for i = 6 . 

If a molecule mg hits the bottom or walls of the reactor, a 
zero-order catalytic reaction of the form 

mg -I- C —>■ mio 

happens. The reaction is a mathematical function 
Rg-.M —> M 
described by the following algorithm: 

1. Mark the point of collision in mg as 1. 

2. Take the 1th atom in mg and swap it with the 1 -I- fcth 
atom. With a probability > 0.5, assign fe = 1, else 
k = -1. 

3. If 1 = n, swap the 1th atom with the first atom instead 
(for k = 1). 

4. If 1 = 1, swap the 1th atom with the nth atom instead 
(for k = — 1 ). 

The reactor algorithm A operates on a universe of molecules 
S = {mi,..., m|s|}, l^l << \M\. The development of S is 
realized by applying the following algorithm: 

1. Initialize S with l^l molecules selected randomly from 
M. 


2. Using stochastic sampling with replacement, select two 
molecules mi and m 2 from S without removing them. 

3. Apply the reaction rule Ri if mi and m 2 encode 
solutions to the same problem. Otherwise, apply R 2 
instead to get the products m 3 and mi. 

4. Apply the reaction rule Rg for heavy molecules that 
collide with the reactor walls and bottom. 

5. Decay the heavier molecules by removing them out of S 
and replacing them with randomly selected molecules 
from M. 

6 . Repeat steps 2 to 5 until S is saturated with lighter 
molecules. 

One iteration of A constitutes one epoch in the artificial 
reactor. The sampling procedure gives molecules with low 
molecular mass a higher probability to react or collide with 
other molecules. This mimics the level of excitation energy 
the molecule needs to overcome for it to react with another 
molecule. This means that the lighter the molecule, the 
higher the chance that it will collide with other molecules. 
Step 6 of algorithm A requires a metric for measuring sat¬ 
uration of molecules. In this study, when the number of 
molecules in that level of excitation has reached 90% of the 
total molecules encoding the same problem, the ACR will 
stop applying the reaction rules for the same problem and 
will consider it solved while continuing the simulation for 
the remaining problems. 

3. RESULTS AND DISCUSSION 

Using the same datasets and parameters from results on 
recently published papers on RH mapping [2], TSP |16 |. 
and ALSP [ 8 ], the ACR was run to solve the three prob¬ 
lems in parallel. A single-processor Pentium IV machine 
with 1.2GHz bus speed running under a multiprogramming 
operating system was used to run the ACR simulations. 
The ACR simulation was repeated 10 times while each of 
the problem’s metrics (i.e., the best minimum for each run) 
were recorded. The values recorded were averaged and the 
standard deviation computed. The results of the runs were 
compared to those of the recent literature. The comparison 
are summarized in tables. 

3.1 RH Mapping 

Tables[T]and[2]show the average obligate chromosome breaks 
per marker for the GB4 and G3 RH panels, respectively, as 
found by RHMAPPER and CONCORDE (both reported 
by 12 ]) and ACR. The tables show the averages of 10 runs 
for each chromosome for ACR and the respective standard 
deviations. The tables show that the result found by ACR is 
not different from the results found by either RHMAPPER 
or CONCORDE. 

3.2 Symmetric and Asymmetric TSPs 

Table [3] compares the average tour lengths found by ACR, 
simulated annealing, and self-organizing maps on five sets 
of random instances of symmetric 50-city TSPs. The table 
shows the average value of 10 runs for ACR and the respec¬ 
tive standard deviation. Table U] on the other hand, com¬ 
pares the best integer tour length found by ACR and genetic 


Table 1: The average obligate chromosome breaks 
per marker for the GB4 panel as found by 
RHMAPPER, CONCORDE and ACR. The ACR 
values are averaged over 10 runs while the values in 
parenthesis are the respective standard deviation. 


Chromosome 

Number 

RHMAPPER 

CONCORDE 

ACR 

(std. dev.) 

1 

- 

1.66 

1.88 (0.20) 

2 

3.80 

2.12 

2.56 (0.64) 

3 

2.71 

1.97 

1.80 (0.85) 

4 

3.75 

2.15 

3.22 (0.55) 

5 

3.37 

1.99 

2.67 (0.38) 

6 

2.60 

1.70 

2.60 (0.80) 

7 

2.86 

1.92 

1.88 (0.90) 

8 

3.64 

2.09 

2.88 (0.47) 

9 

2.86 

1.85 

2.88 (0.99) 

10 

3.55 

2.04 

1.99 (0.55) 

11 

2.53 

1.86 

1.95 (0.64) 

12 

3.87 

1.98 

3.27 (0.85) 

13 

2.92 

2.01 

2.46 (0.85) 

14 

2.43 

1.79 

2.15 (0.59) 

15 

4.16 

2.25 

3.00 (0.88) 

16 

3.18 

2.32 

2.74 (0.78) 

17 

2.74 

2.03 

2.00 (0.67) 

18 

3.07 

2.47 

2.20 (0.83) 

19 

2.78 

1.99 

2.66 (0.74) 

20 

2.41 

1.74 

2.50 (0.70) 

21 

2.64 

2.19 

2.33 (0.41) 

22 

2.87 

2.17 

2.33 (0.64) 

23 

2.36 

1.70 

2.36 (0.66) 


algorithm on four examples of asymmetric instances of TSP. 
Based on the data presented in both tables, it can be seen 
that ACR’s performance is similar to the performances of 
the other search techniques employed by other researchers. 

3.3 Scheduling Aircraft Landings 

Table [5] shows the performance of ACR as compared to the 
optimal solutions of eight instances of aircraft landing prob¬ 
lems. From the data presented, it can be seen that ACR 
was able to find near optimal solutions for six out of eight 
problems, and the exact solution for two out of eight prob¬ 
lems. 

4. CONCLUDING REMARKS 

An ACR that mimicked catalytic reactions was designed to 
simultaneously solve three unrelated problems. The cat¬ 
alytic reactions of the ACR were able to construct RH maps 
with similar qualities found by deterministic algorithms. In 
addition, ACR was found to be applicable in finding solu¬ 
tions to TSPs and ALSPs in parallel with RH mapping. 
Thus, ACR may be used to simultaneously solve hard prob¬ 
lems such as the RH mapping, TSP, and ALSP. 

The work described in this paper can be extended as follows. 
In order to assess its efficiency, additional experiments are 
needed with the ACR simulating more than three problems 
simultaneously. This experiment will answer the question: 
”Up to how many problems will the ACR be still efficient?” 
Using a single-processor machine, as the number of problems 
being solved simultaneously is increased, it is expected that 


Table 2: The average obligate chromosome breaks 
per marker for the C3 panel as found by 
RHMAPPER, CONCORDE and ACR. The ACR 
values are averaged over 10 runs while the values in 
parenthesis are the respective standard deviation. 


Chromosome 

Number 

RHMAPPER 

CONCORDE 

ACR 

(std. dev.) 

1 

- 

2.91 

2.88 (0.06) 

2 

4.96 

2.88 

2.92 (0.02) 

3 

5.18 

3.13 

2.10 (0.03) 

4 

5.52 

2.96 

4.21 (0.47) 

5 

5.17 

3.09 

3.52 (0.98) 

6 

4.76 

3.11 

2.66 (0.01) 

7 

5.91 

3.69 

2.25 (0.65) 

8 

5.34 

3.09 

3.36 (0.22) 

9 

4.73 

3.16 

3.33 (0.55) 

10 

5.35 

3.47 

2.81 (0.54) 

11 

5.79 

3.24 

2.70 (0.06) 

12 

5.31 

3.32 

3.60 (0.90) 

13 

4.58 

3.17 

2.86 (0.69) 

14 

4.04 

2.93 

2.81 (0.20) 

15 

4.70 

3.76 

3.19 (0.50) 

16 

5.04 

3.49 

2.75 (0.26) 

17 

4.39 

3.69 

2.01 (0.32) 

18 

6.10 

3.88 

2.49 (0.56) 

19 

4.95 

3.23 

2.82 (0.10) 

20 

4.87 

3.70 

3.15 (0.65) 

21 

3.79 

3.36 

2.42 (0.32) 

22 

4.21 

3.41 

3.31 (0.88) 

23 

4.35 

2.80 

2.84 (0.47) 


the ACR will suffer considerably in terms of efficiency. Thus, 
further investigations on the use of multi-processor machines 
are needed to achieve better ACR performance. 
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Table 3: Comparison of average tour length 

found by ACR, simulated annealing (SA), and self¬ 
organizing maps (SOM) on five sets of random 
instances of symmetric 50—city TSPs. ACR values 
are averaged over 10 runs. The values in parenthesis 
are the standard deviation of the 10 runs for ACR. 


Problem 

ACR (std. dev.) 

SA 
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5.87 (0.33) 
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2 

6.15 (0.08) 
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5.59 (0.21) 
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5.81 

5.87 

5 
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6.33 

6.70 

















Table 4: Comparison of the best integer tour length 
found by ACR and genetic algorithm (GA) on four 
examples of asymmetric instances of TSPs. 


Problem 

ACR 

GA 

Oliver 30 

421 

421 

Eil50 

424 

428 

Eil75 

550 

545 

KrotlOO 

21,280 

21,761 


Table 5: Comparison between the optimal cost of 
scheduling aircraft landings and those found by ACR 
on eight instances of ALSP. 


Problem 

Number 

Number of 
Aircrafts 

Number of 
Runways 

Optimal 

Solution 

ACR 

1 

10 

1 

700 

721 

2 

10 

2 

90 

94 

3 

20 

1 

1,480 

1,463 

4 

20 

2 

210 

220 

5 

30 

1 

24,442 

24,536 

6 

30 

2 

554 

554 

7 

50 

1 

1,950 

2,001 

8 

50 

2 

135 

135 
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